Unicode symbols

Character Types

Scripts

Symbols

Other Topics

In computing, in addition to encoding characters for the various writing systems used throughout the World, Unicode also devotes several blocks of characters to symbols that have a well-defined place in plain text. In Unicode there is a main distinction between "scripts" and "symbols". A character is either part of "script" or of a list of "symbols". Unicode's "Special characters", i.e. with Unicode a specified behaviour like in line-breaking, are also Symbols.

Many of the symbols are drawn from existing character sets or ISO or other national and international standards. As stated in the Unicode Standard 5.0[1], “The universe of symbols is rich and open-ended.” This makes the issue of what symbols to encode and how symbols should be encoded more complicated than the issues surrounding alphabets, syllabaries, logographies, and other writing systems. Typically Unicode has sought to encode symbols that have clear roots in national and international standards. Similarly, it focuses on symbols that make sense in a one-dimensional plain text context. For example, Unicode cites the typical two-dimensional arrangement of electronic diagram symbols as the reason for not including those in the characters set [2]. Of course for adequate treatment in plain text, symbols must also be largely monochromatic. Even with these limitations—monochromatic, one-dimensional and standards based—the domain of symbols is potentially limitless. Unicode has primarily focused on writing systems, CJK ideographs, and numerals. Two recent symbol genre additions are the Mathematical Alphanumeric Symbols (Unicode 3.1) and Yijing Hexagram Symbols (Unicode 4.0).

Contents

Symbol block list

The following Unicode ranges encode Symbols

See also

Notes

  1. ^ Chapter 12 (p295)
  2. ^ Unicode Standard 5.0; Chapter 12 (p302)

References

External links